Chapter 8 – Theoretical Biophysics  363

Thus, the P(A) is the prior of A, P(B) is the evidence, P(B|A) is the likelihood for B given A,

and P(A|B) is the posterior for A given B (i.e., the probability for this occurring given prior

knowledge of the probability distributions of A and B).

Bayes’ theorem provides the basis for Bayesian inference as a method for the statistical

testing of different hypotheses, such that P(A|B) is then a specific model/​hypothesis. Thus,

we can say that

(8.96) P

P

P

P

P

model|data

data model

model

data

data model

(

) =

(

)

(

)

=

(

|

)

(

|

) P

P

P

model

data model

model

allmodel

(

)

(

)

(

|

)

P(data|model) is an experimentally observed dataset, P(model) is a prior for the given model,

and P(model|data) is the resultant prediction for posterior probability that that model

accounts for these data. Thus, the higher the values of P(model|data), the more likely it is that

the model is a good one, and so different models can thus be compared with each other and

ranked to decide which one is the most sensible to use.

For using Bayesian inference to discriminate between different models of diffusion for

single-​particle tracking data, we can define the following:

1 The likelihood, P(d|w,M), where d represents the spatial tracking data from a given

biomolecule and w is some general parameters of a specific diffusion model M. This is

the probability distribution of the data for a given parameter from this model.

2 The prior, P(w|M). This is the initial probability distribution function to any condi­

tioning by the data; priors represent any initial estimate of the system, such as distri­

bution of the parameters or the expected order of magnitude.

3 The posterior, P(w|d,M). This is the distribution of the parameter following the con­

ditioning by the data.

4 The evidence, P(d|M). This acts as a normalization factor and is a portable unitless

quantity.

One such analytical method that employs this strategy in inferring modes of diffusion

from molecular tracking data is called “Bayesian ranking of diffusion” (BARD). This uses

two stages in statistical inference: first, parameter inference, and second, model selec­

tion. The first stage infers the posterior distributions about each model parameter, which

is defined as

(8.97)

P w d M

P d w M P w M

P d M

( | ,

)

( |

,

) ( |

)

( |

)

=

The second stage is model selection:

(8.98)

P M d

P d M P M

P d

(

| )

( |

)

=

(

)

( )

P(M|d) is a number that is the model posterior, or probability. P(M) is a number that is the

model prior (model priors are usually flat, indicating that the initial assumption is that all

models are expected equally), P(d|M) is a number that is the model likelihood, and P(d) is a

number that is a normalizing factor that accounts for all possible models. This now generates

the posterior (i.e., probability) for a specific model. Linking the two stages is the term P(d|M),

the model likelihood, which is also the normalization term in the first stage. Comparing

calculated normalized P(d|M) values for each independent model then allows each model

to be ranked from the finite set of diffusion models tried in terms of the likelihood that it

accounts for the observed experimental tracking data (e.g., the mean square displacement vs

τ values for a given tracked molecule).